Chapter 2 



Heuristic Formulation of 
Time-Frequency Distributions 



Having established the basic signal formulations in the first chapter, we now turn 
to the problem of representing signals in a joint time-frequency domain. Given 
an analytic signal z(t) obtained from a real signal s(t), we seek to construct a 
time- frequency distribution p z (t , /) to represent precisely the energy, temporal and 
spectral characteristics of the signal. We choose the symbol p z in the expectation 
that the TFD will represent an “energy density of z” in the (t, /) plane. We would 
also like the constant-f cross-section of p z (t,f ) to be some sort of “instantaneous 
spectrum” at time t. 

In this chapter we examine a variety of ad hoc approaches to the problem, namely 
the Wigner-Ville distribution (Section 2.1), a time-varying power spectral density 
called the Wigner-Ville Spectrum (2.2), localized forms of the Fourier Transform 
(2.3), filter banks (2.4), Page’s instantaneous power spectrum (2.5), and related 
energy densities (2.6). Finally (in Section 2.7), we show how all these distributions 
are related to the first-mentioned Wigner-Ville distribution, thus setting the scene 
for the more systematic treatment in the next chapter. 

The various distributions are illustrated using a linear FM asymptotic signal. 
The linear FM signal [Eq. (1.1.5)] is regarded as the most basic test signal for TFDs 
because it is the simplest example of a signal whose frequency content varies with 
time. It is clearly monocomponent, and is asymptotic if its BT product is large. 
The minimum requirement for a useful TFD is that it clearly shows the IF law of 
an asymptotic linear FM signal, giving a reasonable concentration of energy about 
the IF law (which, for an asymptotic signal, is equivalent to the TD law). 
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2.1 Method 1: The Wigner-Ville Distribution 

For a monocomponent signal, it is reasonable to expect that the TFD should take 
the form of a knife-edge ridge whose crest is a graph of the IF law in the (t, f) plane. 
Mathematically we idealize the “knife edge” as a delta function w.r.t. frequency. 

2.1.1 Knife-Edge IF Indication 

Noting that p z is a function of frequency f and represents a kind of spectrum, we 
may reasonably require p z to be the FT of some function related to the signal. Let 
us call this function the signal kernel and give it the symbol K z . So the signal 
kernel can be written as K z (t,r), and the TFD is 

Pz(t,f)= F{K x (t,T)}. (2.1.1) 

r — ► / 

2.1.2 Formulation of the Signal Kernel 

To find a suitable form for K z (t,r), for simplicity, let us first consider the case of 
the unit-amplitude monocomponent FM signal 

z(t) = e^ (t) (2.1.2) 

whose instantaneous frequency is 

= (2-1.3) 

We would like the TFD of z(t) at any given time to be a unit delta function at 
the instantaneous frequency, so that the “instantaneous spectrum” reduces to the 
ordinary Fourier transform in the constant-frequency case; that is, we want 



Pz(t,f) = 5{f-fi(t)). 

Substituting this into Eq. (2.1.1) and taking the IFT, we obtain 
K z (t, t ) = T- 1 {6(f - /<(*))} = 

T*—f 

— gj&'W'T 



Because in Eq. (2.1.5) is not directly available, we write 
m = lim , 



(2.1.4) 



(2.1.5) 



( 2 . 1 . 6 ) 



and use the approximation 

4>'{t) « \ [4>{t+ I) - <£(*- I)] 



(2.1.7) 




Method 1: The Wigner-Ville Distribution 



31 



which is called the central finite-difference (CFD) approximation [1,2]. Substi- 
tuting Eq. (2.1.7) into Eq. (2.1.5) and using Eq. (2.1.2) gives the signal kernel 

K z (t, t) = e m+ ^ e- m -^ 

= *(i +§)**(*-§). (2.1.8) 

2.1.3 The Wigner Distribution 

Substituting Eq. (2.1.8) into Eq. (2.1.1), we obtain 

/ OO 
-OO 

Eq. (2.1.10) is given the symbol W z (t, /), and is called the Wigner distribution 
(WD) in honor of its discoverer, 1 who derived it in 1932 in a quantum-mechanical 
context [3]. 

The approximation in Eq. (2.1.7) is exact if 0 is a linear function of t, i.e. if 4>'{t) 
is constant; it is also exact if <f>(t) is quadratic [4, p. 298], i.e. if is linear. Thus 
the WD gives an “unbiased” estimate of the IF for a complex linear FM signal. 

The constant-frequency real signal 

s(t) = cos 2nf c t (2.1.11) 

leads to the signal kernel 

K s (t,r) = s(t + §) s(t - §) 

= cos 2nf c (t + §) cos 2nf c (t - § ) 

= \ cos 2nf c T + | cos 2t: 2 f c t. (2.1.12) 

Taking Fourier transforms w.r.t. r gives the WD 

We(t,f) = \S(f — f c ) + jS(f + f c ) 

+ i[cos27r2/ c t]5(/). (2.1.13) 

The terms in S(f - F / c ) are naturally expected and arise because s(t) may be ex- 
pressed as a sum of complex sinusoids at frequencies =b/ c . The term in S(f) is an 
artifact arising because the nonlinearity of the WD causes interaction between the 
positive- and negative- frequency terms. 

P. Wigner (1902-1995) was born in Hungary, studied chemical engineering in Germany and 
eventually settled in the United States, where he specialized in mathematical physics. He was a 
joint winner of the 1963 Nobel Prize for Physics for his many contributions to particle physics, 
including his law of conservation of parity and his work on the strong nuclear force. 



(2.1.9) 

( 2 . 1 . 10 ) 
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F$=1Hz N=65 Fs=1 Hz N=65 

Time-res=1 Time-res=1 




Fig. 2.1.1: Wigner distribution (left) and Wigner-Ville distribution (right) of a real linear FM signal 
[Eq. (1.1.5)] with duration 65 samples, starting frequency 0.1 and finishing frequency 0.4 (sampling 
rate 1Hz). Note the low-frequency artifacts in the Wigner distribution, caused by interaction between 
positive and negative frequencies. 



By a similar argument, we find that the non- windowed linear FM signal 



s(t) = A cos (2tt [f 0 t + ft 2 ]) 



leads to the signal kernel 



K s {t, t ) = \A 2 cos 2nfi(t)T 



+ \ A 2 cos27r 



- + 2/gf + (rf 2 



(2.1.14) 



(2.1.15) 



where fi(t) = fo + at. Taking Fourier transforms w.r.t. r gives the WD 



f) = jA 2 5(f - m) + \A 2 5(f + 



u- 






cos 2n 






y 



(2.1.16) 



The terms in S(f =F fi(t)) are naturally expected, while the last term in the signal 
kernel gives rise to a continuum of artifacts in the WD (see Fig. 2.1.1). 

These artifacts, which greatly diminish the usefulness of the WD for real signals, 
are removed by modifying the WD with the analytic signal in accordance with the 
following. 
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2.1.4 The Wigner-Ville Distribution 

Definition 2.1.1: The Wigner-Ville distribution (WVD,) of a signal s(t), de- 
noted by W z (t,f), is defined as the WD of its analytic associate, i.e. 

W z (t, f) = JF f {z{t + |) z*(t - §)} (2.1.17) 

where z(t ) is the analytic associate of s(t). 

The name “Wigner- Ville distribution”, as opposed to “Wigner distribution”, em- 
phasizes the use of the analytic signal [5] and recognizes the contribution of Ville [6] , 
who derived the distribution in a signal-processing context in 1948. Noting that a 
signal can have a time-dependent frequency content, Ville sought an “instantaneous 
spectrum” having the attributes of an energy density (property 1 in Section 1.1.5) 
and satisfying the so-called marginal conditions: 2 

• the integral of the TFD p z (t, /) w.r.t. frequency is the instantaneous power 

m I 2 ; 

• the integral of the TFD p z {t,f) w.r.t. time is the energy spectrum \Z(f)\ 2 . 

By analogy with the conditional moments of a p.d.f., and using known relationships 
between the moments of a p.d.f. and its characteristic function, Ville was able to 
show that the distribution now known as the WVD had the desired properties [7, 
pp. 946-7]. Using Eq. (2.1.8), we obtain 

W z (t,f)= F{K z {t,r)}. (2.1.18) 

The signal kernel K z (t, r) is also called the instantaneous autocorrelation func- 
tion (IAF) of z(t). 

Furthermore, all TFDs in this Chapter, unless otherwise stated, are defined from 
the analytic associate of the signal, not from the real or “raw” signal. 

2. 1.4.1 The WVD of a Linear FM Signal 

Eq. (2.1.2) describes a constant-amplitude, infinite-duration signal. We can allow 
for non-constant amplitude and finite duration using the form 

z(t) = a(t) e^ (t) (2.1.19) 

where a(t) is real. For this signal we find 

I< z (t, t) = K a (t, t) e^ (t ’ T) (2.1.20) 

2 The name “marginal” can be explained with reference to the discrete-time, discrete-frequency 
case: if the TFD were written as a two dimensional array of discrete energies, each energy corre- 
sponding to a discrete time (vertical axis) and a discrete frequency (horizontal axis) , then the sum 
over time for each frequency could be written in the horizontal “margin” of the array, and the sum 
over frequency for each time could be written in the vertical “margin”. 
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where 

K a (t,r) = a(t+ §)a(t- §) (2.1.21) 

= (j)(t + §) - (j)(t - ■§). (2.1.22) 

If 4>(t) is quadratic (i.e. if ffit) is linear), then the CFD approximation is exact and 
gives ip{t,T) = = 27r/j(t)r , so that 

K z (t,r) = A' a (t,r)e«W T . (2.1.23) 

So K z (t,r), considered as a function of r, has a constant frequency equal to ffit); 
this reduction is called dechirping. If we let W = T f {K a {t,T)} , so that 

W a {t , /) is the WD of a(t), and take the FT of Eq. (2.1.23) w.r.t. r, we obtain 

w z (t, /) = Wait, f) * 5(f - (2.1.24) 

= W„(t,/-/i(t)). (2.1.25) 

Now K a {t,T) is real and even in r. Hence W a {t, /) is real and even in /, so that 
W z (t, /) is real and symmetrical about / = fi(t) . 

Thus, even for a finite - duration linear FM signal, such as the one analyzed in 
Fig. 1.1.3, we expect the WVD to give a clear indication of the IF law. In fact, the 
TFD shown in Fig. 1.1.3 is the WVD of a finite duration linear FM signal. 

2. 1.4. 2 The WVD in terms of the Spectrum 

The variables t, f and r are called time, frequency and lag, respectively. We now 
introduce the variable v, which represents frequency shift- just as r represents time 
shift; accordingly, v will be called Doppler . 3 
Let 

k z {v,f)= T{W z {t,f)}. (2.1.26) 

Writing out the definitions of the FT and the WVD and taking invariant factors 
inside the integral signs, we obtain 

k z {u, f) = jj z{t + ^)z*it-^)e- j2 < fT+vt) dtdT (2.1.27) 

where the integrals are from — oo to oo. If we write 

x = t + §; y = t-\ (2.1.28) 

and solve for t and r, obtaining 

t=±(x + y) ; t = x — y , (2.1.29) 

3 The well-known “Doppler effect” is actually a frequency scaling. But when the effect is used 
to measure velocity, the scaling factor is usually close to unity and the bandwidth of interest is 
usually narrow. Under these conditions, the frequency scaling is well approximated by a frequency 
shift proportional to the velocity. 
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then the use of the Jacobian yields 

dtdr = dxdy. (2.1.30) 

With these substitutions, we find that Eq. (2.1.27) can be factored into 

/ OO POO 

z(x) e- j2 ^ f+v/2]x dx ■ / z\y) e j2 ^ f - v ^ v dy 

-OO <7—00 

= (2.1.31) 

where Z(f) = T{z(t)} . Noting that k z (v, f) has a similar form to K z (t,r) in the 
Doppler- frequency domain, we describe k z (z', /) as the “spectral autocorrelation 
function”. 

Substituting Eq. (2.1.31) into Eq. (2.1.26) and taking the IFT yields an expres- 
sion for the WVD in terms of Z(f): 

/ OO 

Z(/+f)Z*(/-f)e^^. (2.1.32) 

-OO 

2. 1.4. 3 Effects of Time- and Frequency-Limiting 

A practical signal z(t) is often expected to be both time-limited and band- limited, 
despite theoretical constraints. Let us assume that z(t) is windowed in both time 
and frequency. 

For the time windowing, we can replace z(t) by 

z w (t) = z(t)w(t-t 0 ). (2.1.33) 



The WVD of z w (t) is 

/ OO 

z{t+^)w(t-t 0 + ^)z*{t-l)w*(t-t 0 -l)e- j2nfT dT. (2.1.34) 

-OO 

Putting to =t gives 

/ OO 

g 2 (r)K z (t,T)e-^ T dr (2.1.35) 

-OO 

where 

32 (t) = «>(§)«>*(-§). (2.1.36) 

If . 92 ( 1 ") = 0 for |t| > T/2 , then the limits of integration in Eq. (2.1.35) may be 
changed to ±T / 2. Notice that W Zw (t 1 f) differs from the WVD of z{t) in that the 
IAF is multiplied by g 2 (j) before being Fourier-transformed w.r.t. r. This 92 (t) is 
thus the effective lag window corresponding to the sliding time window w(t — to)- 
For the frequency windowing, we can replace Z(f) by 



Z H {f) = Z{f)H{f-fo). 



(2.1.37) 
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Using Eq. (2.1.32), we can take the WVD corresponding to Zn{f) and put /o = / , 
obtaining in a similar manner to the above: 

/ OO 

Gi(v) Z(f + f) Z*(f - f ) e^du (2.1.38) 

-OO 

where 

G 1 (v) = H(%)H*(-%). (2.1.39) 

If G i(i/) = 0 for \v\ > B/ 2, then the limits of integration in Eq. (2.1.38) may be 
changed to ±B/2. Notice that Wz H {t,f ) differs from the WVD of z(t) in that 
the spectral autocorrelation function is multiplied by G\{y) before being inverse- 
Fourier-transformed w.r.t. v. This G\(v) is the effective Doppler window corre- 
sponding to the sliding frequency window H(f — fo). 

The effects of time windowing and frequency windowing may be combined as 
multiplication by the factor 



9(v, t ) = Gi(v) g 2 {r) (2.1.40) 

in the ( v 1 r) domain. If we consider time-limiting alone, the resulting TFD is given 
by Eq. (2.1.35) and is called the windowed WVD [8-10]. If we consider band- 
limiting alone, the resulting TFD is given by Eq. (2.1.38). We shall call this the 
filtered WVD. The window shapes of G± and g 2 should be selected to ensure the 
properties 1 to 3 in Section 1.1.5 are verified. 



2.2 Method 2: Time-Varying Power Spectral Density 
2.2.1 Spectra of Non-Stationary Random Processes 

If z(t) is a complex random signal, its symmetrical autocorrelation function is 
defined as 

K z (t,r)=£{z(t+%)z*(t-%)} (2.2.1) 

where £{•■•} denotes the expected value. If z(t) is wide-sense stationary, then 
7 Z z (t,r) is independent of t. and the limit 



S z (f) = lim 8 

T — >oo 



l 

T 



F f {z(t) rect (t/T)} 



( 2 . 2 . 2 ) 



is called the power spectral density (PSD) of the random process z(t), and 
describes the distribution of signal power over the frequencies. The PSD is related 
to the autocorrelation function by the Wiener-Khintchine theorem, which states 
that 

Sz(f) = jF f {n x (t, T )}. 



(2.2.3) 
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If z(t) is not wide-sense stationary, the right-hand side of this equation depends on 
t, so that the left-hand side also depends on t, suggesting the generalization 

S z (t,f)= JF f {H z (t,r)}. (2.2.4) 

This S(t, f) may be called the time- varying spectrum or evolutive spectrum (ES), 
and is interpreted as the “time-dependent PSD” of the non-stationary signal. The 
right-hand side of Eq. (2.2.4) may be expanded as 

/ OO 

£{z(t + § ) z*{t - §)} e~ j2nfT dT (2.2.5) 

-OO 

= £ j f z(t + § ) z*(t - §)e- J ' 2w/r dr| (2.2.6) 

S z (t,f) = £{W z (tJ)}. (2.2.7) 

Eq. (2.2.7) shows that the expected value of the WVD is the FT of the time- 
dependent autocorrelation function [11]; that is, the ES is the expected value of 
the WVD. For this reason, the ES is also called the Wigner-Ville spectrum. 

If z(t) is deterministic, Eq. (2.2.7) reduces to 

S z (t,f) = W z (t,f). (2.2.8) 



2.2.2 Estimating the Wigner-Ville Spectrum 

Eq. (2.2.7) refers to an ensemble average of the random process W z (t,f). If we 
have only one realization of this process, we may be able to assume that W z {0^ f) 
is locally ergodic on the interval t — A/2<0<f + A/2, where A is positive and 
independent of t. First we calculate an estimate of W z (t, /) for the local values of 
z(t), yielding 

W z (t,f) = T {g 2 (T)K z (t,T)} (2.2.9) 

= G 2 (f)*W z (t,f) (2.2.10) 



where g 2 (r), the effective analysis window, is real, even and time- limited, and 
G 2 {f)=T{g 2 {r)}. 

Then, to obtain an estimate of S z (t,f), we replace the ensemble average over 
all realizations of W z (t, f) by a time average of W z (t, f) over the interval A. We 
can calculate such an average using a real, even weighting function gi(t), with the 
properties 



f > 0 if |t| < A/2 
[ = 0 otherwise 



( 2 . 2 . 11 ) 



/*oo 

/ gi(t)dt = l . 



— oo 



( 2 . 2 . 12 ) 
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The resulting estimate of S z (t, f ) is 





/»oo 

&(*,/)=/ gi (O-t)W z (6,f)d0. 

J — oo 


(2.2.13) 


Using the evenness of g\ and substituting from Eq. (2.2.10), we have 






= gi(t) *w z (t,f) = gi(t) * G 2 (f) * W z (t, /). 


(2.2.14) 


This can be written 


§z{t, f) = 7 (t, f) ** W z (t, /) 


(2.2.15) 


where 


7 (tj) =9i(t) G 2 (/) 


(2.2.16) 



and the double asterisk denotes double convolution. 

Eq. (2.2.15) defines a class of estimates for time-varying spectra obtained by a 
double convolution of the WVD with a 2D filter. This will be used in the next 
chapter to define quadratic TFDs with specific properties. 



2.3 Method 3: Windowed Fourier Transform (STFT, Spectro- 
gram & Gabor Transform) 

2.3.1 STFT and Spectrogram 

Consider a signal s(r) and a real, even window w(t), whose FTs are S(f) and 
W(f) respectively. To obtain a localized spectrum of s(r) at time r = t , multiply 
the signal by the window w(t) centered at time r = t , obtaining 



S w {t, T ) = s(t ) w(t - t) , 


(2.3.1) 


and then take the FT w.r.t. r, obtaining 




K (t, /) = wiyT ~ 


(2.3.2) 


F™(t,f) is called the short-time Fourier transform (STFT). 

The squared magnitude of the STFT, denoted by S™ (t, /) , is 

trogram: 


called the spec- 


S7(tJ) = \F™ (t, /) 2 

o 


(2.3.3) 


z 

= T {s(r)w(r-t)} 

T—*f 


(2.3.4) 


nOO 2 

= / s(r) w(t — t ) e~F n f T d,T . 

J — OO 


(2.3.5) 



In the notation S™(t, /), the upper-case S stands for “spectrogram”, while the sub- 
script stands for the signal and the superscript for the filter. 
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Fig. 2.3.1: Spectrogram of a sinusoidal FM signal [Eq. (1.1.4)] with 65 samples (sampling rate 1Hz), 
A = 1 , f c = 1/4 , / m = 3/64 , fd = 6/64 , rectangular window of length A samples: (a) A = 63 ; 
(b) A = 23; (c) A = 7. 



The observation window w(t) allows localization of the spectrum in time, but 
also smears the spectrum in frequency 4 in accordance with the “uncertainty relation- 
ship” [12], leading to a trade-off between time resolution and frequency resolution. 
The problem is illustrated in Fig. 2.3.1, which shows the spectrogram of a sinusoidal 
FM signal for a rectangular window of three different lengths. If the window is long 
compared with the modulating signal, the frequency resolution is sufficient to show 
the sideband tones (the “multicomponent aspect” [13] of the signal), but the time 
resolution is insufficient to show the FM law (the “monocomponent aspect”). If 
the window is short compared with the modulating signal, the time resolution is 
sufficient to show the FM law but the frequency resolution is insufficient to show 
the sideband tones. 

The spectrogram is nonlinear; but the nonlinearity is introduced only in the fi- 
nal step (taking the squared magnitude) and therefore does not lead to undesirable 
artifacts present in other TFDs. This freedom from artifacts, together with sim- 
plicity, robustness and ease of interpretation, has made the spectrogram a popular 
tool for speech analysis (resolution of speech into phonemes and formants) since its 
invention in 1946 [14]. 

2.3.2 Optimal Window Length of the Spectrogram 

The spectrogram involves a compromise between time resolution and frequency res- 
olution: a longer window provides less localization in time and more discrimination 
in frequency. 

4 Smearing is caused by the convolution operation. If an image (a function of two coordinates) 
is convolved with a confusion pattern (another function of the same two coordinates) , the result is 
a blurred image. If the confusion pattern is a line, we tend to describe the blurring as a “smearing”. 
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Fig. 2.3.2: Spectrogram of a linear FM signal [Eq. (1.1.5)] with duration 65, starting frequency 0.1 and 
finishing frequency 0.4, for a rectangular window of length A: (a) A = 33 ; (b) A = 21 ; (c) A = 11 . 
The optimal window length according to Eq. (2.3.6) is 20.7. 



The purpose of the window is to obtain a time-slice of the signal during which 
the spectral characteristics are nearly constant. If the window is too long, it fails to 
capture the most rapid variations of spectral content. If it is too short, it smears the 
TFD in the frequency dimension without a commensurate improvement in detail in 
the time dimension. The more rapidly the spectral content changes, the shorter the 
window must be. 

Hence, for a monocomponent signal of constant amplitude, the optimal window 
duration is inversely related to the rate of change of the IF. More precisely, if the 
window is rectangular and has total duration A, it can be shown [15, 16] that the 
value 



A = V2 



dfi(t) 



dt 



- 1/2 



(2.3.6) 



is optimal in the sense that it minimizes the half-height width of the resulting ridge in 
the (f , /) plane. This optimal A is proportional to the relaxation time T„ ; compare 
it with Eq. (1.3.44). For a linear FM signal, the optimal window duration simplifies 
to A = \J2T /B , where T is the signal duration and B is the signal bandwidth [17]. 
Fig. 2.3.2 shows the spectrogram of a chirp signal (the same signal as in Fig. 1.1.3) 
for a rectangular window of three different lengths, one of which (A = 21) is optimal 
according to Eq. (2.3.6). 

Even for the optimal window length, the spectrogram is not a delta function 
describing the IF law. The use of this optimal window is inconvenient because it 
requires knowledge of the IF, and this knowledge might be obtainable only by some 
sort of time-frequency analysis. Moreover, if the IF law is nonlinear, the optimal 
window duration varies with time. In the case of the sinusoid FM signal of Fig. 2.3.1, 
the optimal window length is time- varying if the signal is considered as a modulated 
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carrier. Although it is possible to vary the spectrogram window length with time and 
even with frequency, such procedures have a cost in computational efficiency [18]. 
A recent iterative algorithm for matching the spectrogram window to the estimated 
IF, so that the spectrogram of a monocomponent signal is concentrated along the 
IF law, is described in [19] and in Article 10.1. 

2.3.3 STFT vs. Gabor Transform 

In 1946, while studying the requirements for efficient signal transmission, Gabor [12] 
noted that the (f , /) plane can be divided into an array of rectangles using a bank of 
filters, each of which is switched on for a single interval of time and passes a single 
band of frequencies. Each rectangle was called a logon, and its dimensions were 
called the decay time and the tuning width. Gabor noted that the dimensions 
must satisfy the Heisenberg uncertainty relation 

AtAf > y- (2.3.7) 

47 r 

where At and A f are the effective duration and bandwidth of the logon [12]. Gabor 
showed this relationship to be “at the root of the fundamental principle of commu- 
nication” [12], in that it puts a lower limit on the spread of a signal in time and 
frequency. 5 For the minimum elemental area, which is obtained in the case of a 
complex Gaussian signal, Eq. (2.3.7) becomes an equality as in Eq. (1.2.34). 

In Gabor’s representation, each logon is assigned a complex coefficient c n ,k , 
where n is the time index and k the frequency index. The signal s(t) is expanded 
in the doubly infinite series 



s(t) = ^2 °n,k ipn,k{t) (2.3.8) 

n,k 

where the sum is over all integers n and k, and is a function centered about 

time nAt and frequency kAf. To find the coefficients c n ,k , let h nt k{t) and ip n ,k(t) 
be related by 




Knj{t) ip n , k (t)dt = 



1 if m = n and l = k ; 
0 otherwise . 



(2.3.9) 



5 Eq. (2.3.7) is perhaps best known for its corollary in quantum mechanics. If we write x 
(position) for £, and k (wave number) for 2nf, we obtain 

AxAk > 1/2. 

De Broglie’s “matter wave” relation may be written k = 2np/h , where p is the momentum and h 
is Planck’s constant. Making this substitution for k in the above equation, we obtain 

A . h 
Ax A p > — . 

47T 

which, for some readers, will be more familiar than Eq. (2.3.7). 
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In other words, let h m ,i{t) be orthogonal to every ip n ,k(t) except ipm,i{t), or, equiv- 
alently, let tp m> i(t) be orthogonal to every h n ^(t) except h m ,i{t ); functions related 
in this way are called dual functions. Multiplying Eq. (2.3.8) by t (t) and inte- 
grating w.r.t. t, we obtain an expression for c n: k' 

/ OO 

s(t) h* n k (r) dr. (2.3.10) 

-OO 

If we choose 

h n ,k(T) = w(t — nAt) e j27rfeA ^ T (2.3.11) 

where w denotes a real Gaussian function, then Eq. (2.3.10) becomes 



Cn,k 



/ oo 

s(t ) w(t — nAt) e _j27rfcA ^' r dr 

-OO 



T \s(t) w(t — nAt)}. 

— , fc A/ L 



(2.3.12) 

(2.3.13) 



Expression (2.3.13) is known as the Gabor transform. When ip n , kit) is chosen 
as the dual function of h nt k{t), Eq. (2.3.8) gives the inverse Gabor transform. 
Eq. (2.3.13) has the same form as Eq. (2.3.2) except that t and / are discretized, 
wherefore the Gabor transform has been described as a sampled STFT. Suitable 
selections of ip and h need to be made for the TFD to verify the properties 1 to 3 
in Section 1.1.5, at least approximately. 

Lerner [20] extended Gabor’s work by allowing the elements of the (t, /) plane 
to be non-rectangular. Helstrom [21] generalized the expansion by replacing the 
discrete elementary cell weighting with a continuous function £(r, t, /). Wavelet 
theory [22] was developed later as a further extension of Gabor’s work, but choosing 
the partitions of the (f, /) plane so as to give constant-Q filtering [23,24], 



2.4 Method 4: Filtered Function of Time 
2.4.1 Filter Banks and the Sonograph 

Whereas the spectrogram is conceived as a function of frequency with dependence 
on the timing of a window, the sonograph is conceived as a function of time with 
dependence on the tuning of a filter. Consider a signal s(t) with spectrum S{v), 
and a lowpass filter with the real impulse response h{t) and the transfer function 
H{v). To extract the bandpass “component” of s(t) at frequency v = /, we shift 
the filter function in frequency so that it is centered at v = / , multiply the signal 
spectrum by the shifted filter function, and take the inverse FT w.r.t. n, obtaining 

B?{t, f) = JF-i {SM H{v - /)} (2.4.1) 

where the “13” stands for “bandpass”. This signal may be understood as the complex 
output of a bandpass filter with center frequency +/; the input s(t) may be recon- 
structed as the sum of the outputs of such filter banks whose transfer functions 
add up to unity within the bandwidth of s(t). 
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The squared magnitude of Bf(t,f) is denoted by Si 

sonograph or sonogram: 



(t, /) and is called the 



S { s H) {t,f ) 






s(y)H{v-f )} 

/ oo 

5(1/) H(v - /) eP^dv 

-OO 



(2.4.2) 

(2.4.3) 

(2.4.4) 



For the sonograph, the optimal bandwidth of the band-pass filter is related to the 
time delay in the same way that the optimal window length for the spectrogram is 
related to the instantaneous frequency. 

Like the spectrogram, the sonograph is nonlinear, but the nonlinearity is in- 
troduced only in the final step and does not lead to artifacts. Filter banks — the 
practical realization of the sonograph have long been used in music broadcast- 
ing, recording and even home entertainment, although the frequency division is 
not necessarily linear; for example, graphic equalizers and analyzers usually have 
logarithmic frequency division. 



2.4.2 Equivalence to Spectrogram 



Theorem 2.4.1: The spectrogram and sonograph are equal if the window function 
of the spectrogram is real and even and equal to the impulse response of the sonograph 
filter for / = 0 . 

Proof: Applying the inverse convolution property to Eq. (2.4.1), we obtain 






s(t) * h(t)e^ 2n B 

/ OO 

s(t) h(t — t) d T 

-oo 



_J'2t r/i 



T F f { s (r) h(t - T)} 



(2.4.5) 

(2.4.6) 

(2.4.7) 



which yields 






^{s(r) h(t 




(2.4.8) 



Comparing Eqs. (2.3.4) and (2.4.8), we see that the spectrogram and sonograph are the 
same if 



h{t) = w(—t) , (2.4.9) 

which is the case if w(t) is even and equal to h(t). The condition that w(t) be real is 
redundant in the proof, but is assumed in the definition of the spectrogram. ■ 



2.5 Method 5: Instantaneous Power Spectra 
2.5.1 Page Distribution 

This approach attempts to define a kind of “running spectrum” by considering the 
variations of the signal spectrum as time t increases. 
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Given a signal s(t), let us define the running transform SL(f, /) as the FT of 
the signal s up to time t. To do this, we first define the “auxiliary signal” s t (9) as 



s t {0) 



s(&) if 9 < t 

0 if 9 > t . 



Then the running transform is simply 



S-(t,f) = JF{s t (6)} 

0—*J 




s(0) e~ j2 * f8 d9. 



(2.5.1) 



(2.5.2) 



As the normal energy spectrum is the squared magnitude of the FT, so the running 
energy spectrum up to time t, denoted by e s (t,f), is the squared magnitude of 
the running transform: 

e s (t,f) = \S-(t,f)\ 2 =S.(tJ)S*_(t,f). (2.5.3) 



Differentiating each expression in this equation w.r.t. time, and denoting the time- 
derivative of e s (t, f) by P s (t, /), we obtain 



P s(t,f) 



|[|S-(*,/)l 2 ] 






(2.5.4) 

(2.5.5) 



This P being the time-derivative of a time-dependent energy spectrum, may 
be understood as a kind of time- dependent power spectrum. It is now known as the 
Page distribution in honor of its discoverer [25]. 

By substituting the right-hand expression of Eq. (2.5.2) into Eq. (2.5.4), we ob- 
tain 



P s(t,f) 



d_ 

dt 




21 



s{9) e~^ f0 d9 



(2.5.6) 



which is the usual definition of the Page distribution. Using Eq. (2.5.2) to evaluate 
the partial derivatives in Eq. (2.5.5), we obtain the alternative expression 



or, substituting from Eq. (2.5.2) and writing r = t — 9 , 

P s (t,f) = 2Re|y s* (t) s{t - t) e 32 *^ dr^ . 
If s{t) is real as Page assumed [25], Eq. (2.5.8) becomes 

pOO 

p s{t,f) = 2 / s(t.) s{t - t) cos(27t/t) dr. 

Jo 



(2.5.7) 



(2.5.8) 



(2.5.9) 
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The Page distribution of a time- limited linear FM signal is shown in Fig. 2.7.1(c). 
Notice that the distribution can take negative values, which are inconsistent with 
the notion of an energy distribution, but perfectly consistent with the notion of 
an energy gradient as defined in Eq. (2.5.6). As negative values compensate for 
earlier spurious positive values caused by the truncation of s(t) to produce St(0), 
the presence of negative values implies that energy is delocalized in the ( t , /) domain. 



2.6 Method 6: Energy Density 
2.6.1 Rihaczek’s Complex Energy Density 

In search of a TFD localized in both time and frequency, Rihaczek [26] considered 
the energy of a complex deterministic signal over finite ranges of t and f, and allowed 
those ranges to become infinitesimal, obtaining what he called a complex energy 
density. 

Here we offer a simpler derivation than that given by Rihaczek. The energy of 
a complex signal z(t), with Fourier transform Z(f), is 



E = 



' — OO 
nOO nOO 



z(t)\ 2 dt = / z{t)z*{t)dt = 

J — OO 

R z(t,f)dtdf 



/ OO 

zy^e-^dfdt 

-oo 



' — oo*/ — oo 



where R z (t, /), the energy density function, is defined by 

R z (t,f) = z(t)Z*(f)e~^ t . 



( 2 . 6 . 1 ) 



( 2 . 6 . 2 ) 



This R z (t, f) is the Rihaczek distribution (RD). If we express Z(f) and hence 
Z*{f) in terms of z(A) and use the substitution t = t — X , we obtain the alternative 
form 

nOO 



R z(t,f)= / z{t) z*(t — r) e j2nfT dr. 



(2.6.3) 



A distribution equivalent to Rihaczek’s was derived earlier, in the context of 
quantum mechanics, by J. G. Kirkwood [27], so that the RD is also called the 

Kirkwood-Rihaczek distribution [28, p. 26]. 

From Eq. (2.6.2) it is easily verified that 

/ OO 

Rz(t,f)df = \z(t)\ 2 (2.6.4) 

-OO 

/ OO 

R z (t,f)dt = \Z(f)\ 2 . (2.6.5) 

-OO 

That is, the RD satisfies the marginal conditions [see the discussion following 
Eq. (2.1.17)]. 
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Integrating Eq. (2.6.4) w.r.t. t and Eq. (2.6.5) w.r.t. /, we obtain respectively 





Rz(t,f)dfdt 




( 2 . 6 . 6 ) 



rf2 poo pf2 

/ / r z (t,f)dtdf= / \Z(f)\ 2 df. (2.6.7) 

j fl J-OO j fl 

The right-hand side of Eq. (2.6.6) is the energy in the time interval between t\ and 
t '2 , while the right-hand side of Eq. (2.6.7) is the energy in the frequency band 
between /i and Together, the two equations indicate that R z (t, f) can be in- 
terpreted as an energy density over an arbitrary time interval and an arbitrary 
frequency band. 

In Eq. (2.6.2), the RD has the signal z(t) as a factor. It follows that the RD is 
zero at those times when z(t) is zero; this property is called strong time support. 
Similarly we see that the RD is zero at those frequencies for which the spectrum 
Z(f) is zero; this property is called strong frequency support. 



2.6.2 Levin’s Real Energy Density 

The Levin distribution (LD) is simply the real paid of the RD. It follows that 
the LD, like the RD, has strong time support and strong frequency support. Taking 
the real parts of Eqs. (2.6.4) to (2.6.5), we further conclude that the LD satisfies 
the marginal conditions and their corollaries. 

Let the LD of the complex signal z(t) be denoted by L z (t,f), Taking the real 
part of Eq. (2.6.2), we obtain the definition 

L z (t, f) = Re{z(t) Z*(f) e-™*} . (2.6.8) 

Taking the real part of Eq. (2.6.3) yields the alternative expression 



L z {t,f) = Re 



J z(t)z*(t — r)e ^ 2 ^ T dr | 



(2.6.9) 



If z(t) is replaced by a real signal s(t), Eq. (2.6.9) reduces to the cosine form obtained 
by Levin [29]. 

Historically, the distribution obtained by Levin was a modification of the Page 
distribution and a precursor of the RD. But it was first discovered in a quantum- 
mechanical context by Margenau and Hill [30]; so it is also called the Margenau- 
Hill distribution [28, p. 26]. 



2.6.3 Windowed Rihaczek and Levin Distributions 

Because energy is a real quantity, the real part of the RD is more significant than 
the imaginary part or the magnitude. Hence a “plot of the RD” is usually a plot of 
the real part, i.e. the LD. Fig. 2.7.1(d) shows such a plot for a time-limited linear 
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FM signal. Although Eqs. (2.6.6) and (2.6.7) indicate that the energy of the TFD is 
well localized in the time and frequency dimensions separately, i.e. in strips parallel 
to the / and t axes, it does not follow that the energy is well localized in both 
dimensions at once. Indeed Fig. 2.7.1(d) shows that the peaks of the TFD are not 
confined to the IF law, but show many spurious features. The WVD of this signal 
[Fig. 2.7.1(a)] is much cleaner. Because the RD/LD performs so poorly on such a 
simple signal, it must be regarded as only of theoretical interest. 

By comparison with the RD, the spectrogram is remarkably free of artifacts. 
Recall that the spectrogram is the squared magnitude of the STFT. So one way to 
reduce artifacts in the RD is to introduce the STFT as a factor [31] instead of the 
spectrum Z(f ), in Eq. (2.6.2). The resulting distribution, which might be called the 
windowed Rihaczek distribution, is 

Pz(t, f) = z(t) JF f {z(r)w(T - t)} (2.6.10) 

where w is the window function of the STFT. As the real part of the RD is the LD, 
we might as well designate the real part of the windowed RD as the windowed 
Levin distribution. A windowed LD of a time-limited linear FM signal is shown 
in Fig. 2.7.1(e); note the reduction in artifacts compared with the conventional LD. 

From the form of Eq. (2.6.10), we see that the windowed RD and the windowed 
LD have strong time support. Other properties of these distributions will emerge 
in later sections. 

2.7 Relationship between TFDs 

So far, we have considered six different approaches to defining a TFD. All the 
approaches seem natural and reasonable, and yet lead paradoxically to at least five 
different definitions of a TFD, all of them quadratic in the signal. Using the “signal 
kernel” approach [Eq. (2.1.1)], we may write for any TFD p z (t, /): 

Pz(t,f)= f{R z {t,T)}. (2.7.1) 

T-*f 

where R z (t,r) is found simply by taking the inverse FT of p z (t,f) w.r.t. /. In 
the case of the WVD, R z (t,r) is the instantaneous autocorrelation function (IAF), 
denoted by K z (t, r). In other cases we shall call i? z (t, r) the smoothed IAF. The 
reason for this term will become apparent as we relate R z (t,r) to K z (t,r ) for each 
of the TFDs that we have defined. 

2.7.1 Spectrogram 

From Eq. (2.3.4), the spectrogram with window function w can be rewritten in the 
generalized notation as 

Pz(t, f) = T {z(T)w(T-t)}\ T {z(r)w(r-t)} 1 . 

T— *■/ lr—> yf J 



(2.7.2) 
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Taking the IFT (/ — > r) of both sides gives 



R z {t, T ) = [z(T)w(T-t)]*[z*(-T)w*(-T-t)] 

/ OO 

2(A) w(X—t) z*(X—t) u*(X—T — t) dX 

-00 

/ OO 

z(u+%) w(u—t + §) z*(u — 77) w*(u — t — |) du 

-OO 



(2.7.3) 



where A = u+§ is the dummy variable in the convolution. Exploiting the evenness 
of w, this can be written 

/ OO 

w*(t—u+^) w(t — u — 7^) z(u+^) z*(u — |) du 

-OO 

= G(t,T)*K z (t,T) (2.7.4) 

where 

G(t,r) = io*(t + ^)«;(t — J) (2.7.5) 

and K z (t,r ) is given by Eq. (2.1.8). G(t,r) is called the time-lag kernel. 6 
Eq. (2.7.4) defines the time-lag kernel as that which must be convolved in time with 
the IAF to obtain the smoothed IAF ; the word “smoothed” refers to the convolution. 
We shall adopt this definition for all TFDs given by Eq. (2.7.1). 

Eq. (2.7.5) gives the time-lag kernel for the spectrogram. As w is real, we may 
interchange w and w* , leading to the conclusion that the time-lag kernel for the 
spectrogram is simply the IAF of the observation window function. 



2.7.2 Wigner-Ville Distribution 

The “smoothed IAF” for the WVD is 



R z (t,r ) = K z (t,r) = 8{t) * K z (t, t ) (2.7.6) 

so that 

G(t, t) = 8{t). (2.7.7) 

In this trivial case, the “smoothing” makes no difference. 

2.7.3 Rihaczek Distribution 

Eq. (2.6.3) may be written 

p z (t,f) = JF f {z(t)z*(t-T)}. (2.7.8) 



6 The term “kernel” was used in this sense by Claasen and Mecklenbrauker [32]. To minimize 
the risk of confusion between the “time-lag kernel” G(t,r) and the “signal kernel” K z (t,r), the 
latter is usually called the IAF in this book. 
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Taking the IFT gives 

R z (t, t ) = z(t) z*(t — t) (2.7.9) 

= W - f) *[*(*+ 5) **(*" 5)1 (2. 7. to) 

= 6(t-Z)*K z (t,T) (2.7.11) 

so that 

G(i,T)=5(i-§). (2.7.12) 

2.7.4 Levin Distribution 

Using Eq. (2.7.8) as the definition of the Rihaczek distribution, and taking the real 



part, we obtain for the Levin distribution 

Pz{t,f) = Rejj^i) **(*-■»-)}} (2.7.13) 

= hjF f i z ( t ) z *(t-T)} + \ JF f {z(t)z*(t-T)} . (2.7.14) 

Taking the IFT, we obtain 

R z {t, t) = \ z(t) z* (t - t) + \ z* (t) z(t + t) (2.7.15) 

= \5{t-%)*K z {t,T) + \8{t+%)*K z {t,T) 

= ^[8(t+^) + 8(t~l)\*K z (t,T) (2.7.16) 

so that 

G(i,r) = §[«(*+ §) + $(*-§)]. (2.7.17) 

2.7.5 Windowed Rihaczek Distribution 

Eq. (2.6.10) can be written 

Pz (tJ) = z(t)JF f {z*(- T )w*(-T-t) }e-^t (2.7.18) 

= z(t) F {z*{t-T)w*{-T)}. (2.7.19) 

Taking the IFT gives 

R z (t,r) = z(t) z*(t — t) w*(— t) (2.7.20) 

= [6{t-Z)*K z (t,T)]w*(-T) (2.7.21) 

= [w*(-T)S(t-^)}*K z (t,T) (2.7.22) 



so that 



G(t,r) =w*(-T)S(t- §). 



(2.7.23) 
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Because w is real and even, this reduces to 

G{t,r) =w(r)6(t- §). (2.7.24) 

Comparing Eqs. (2.7.11) and (2.7.22), we see that the smoothed IAFs of the RD 
and the windowed RD differ by the factor w*(— r); that is, the distributions differ 
by a windowing operation in the lag domain prior to Fourier transformation from 
t to f [31]. 

2.7.6 Windowed Levin Distribution 

From Eq. (2.7.19), the real part of the windowed RD is 

h z (hf f {z*( t - T ^ w *F T )} + h z *(t)JF f {z*(t-T)w*(-T ) } . (2.7.25) 

Taking the IFT, we find that the smoothed IAF of the windowed LD is 

R z (t, T ) = \ z(t) z*(t. - t) w*(-t) + \ z*(t) z(t + t) w(t) 

= 3 + \ [S(t+%)* K z (t,T)}w(r) 

= ^[w(T)5{t+^)+w*(-T)6(t-^)]*K z (t,T) (2.7.26) 

so that 

G(t, t) = \ [ty(r) S(t+ \ ) + w*(—t ) 8(t— §)] (2.7.27) 

or, because w is real and even, 

G(t, t) = \ w(t ) [<5(t + f ) + 5(t - \ )]. (2.7.28) 

Comparing Eqs. (2.7.16) and (2.7.26), and noting that w is real and even, we see 
that the smoothed IAFs of the LD and the windowed LD differ by the factor w(t); 
that is, the distributions differ by a windowing operation in the lag domain [31]. 



2.7.7 Page Distribution 



Rewriting Eq. (2.5.8) using the unit step function u(t), we obtain for the Page 
distribution 



p z (t, f) = 2 Re j J z*(t) z(t- A) u( A) e i2,r/A d\ J . (2.7.29) 



With the substitution r = —A, this becomes 



Pz{t,f) = 2Rej^{2*(f)2(f + t ) u (— r)} j 

= JF f {z*{t)z(t+ T ) u (-T)} + JF^{z*(t) z(t+r) «(— t )} . 



(2.7.30) 

(2.7.31) 
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Taking the IFT, we obtain 

R z (t,r) = z*(t) z(t+r) u(—t) + z(t) z*(t—r) u(t) (2.7.32) 

= u(-T)[S(t+^)*K z (t,T)\ + u(T)[6(t-^)*K z (t,T)] (2.7.33) 

= [u(-T)5(t+%) + u(T)5(t-%)]* K z (t,r) (2.7.34) 

so that 

G(t,r ) = u(— t) S(t + ^) + u(t) S(t — ^) (2.7.35) 

= W-Iil)- (2-7-36) 

2.7.8 Relationship between the WVD and Other TFDs 

By taking the FT of Eq. (2.7.4) w.r.t. r and using specific forms for G(t, r), all the 
considered TFDs can be written in the same form as Eq. (2.2.15): 

p z (tJ)=7(t,f)**W z (t,f) (2.7.37) 

where 

7 (t,f)= (2.7.38) 

T— ►/ 

is the TFD time-frequency kernel. 

This then suggests that all the TFDs naturally introduced so far can be consid- 
ered to be smoothed WVDs. This observation led to the design or rediscovery of 
several other TFDs that are briefly discussed next. 

2.7.9 Other Popular TFDs 

Having derived some TFDs by intuitive methods and then determined their time- 
lag kernels, let us now define a few more TFDs directly in terms of their time-lag 
kernels. 



Name 


G(t,r) 


Windowed WVD (io-WVD) 


S(t) w(t) 


Sine or Born-Jordan (BJ) 


|2ar| reCt 2ar 


Exponential or Choi-Williams (CW) 


x/ttct -7i- 2 <Tt 2 /r 2 

M e 


Windowed sine or Zhao-Atlas-Marks (ZAM) 


W ( T ) rect 2^7 


B-distribution (BD) 


r \P cosh ~ 2/3 t 


Modified B-distribution (MBD) 


cosh - 2/3 t 


f^° o oCosh -2 ^ £ d£ 



In the above table, the parameters a, er, a and j3 are real and positive, and w(t) is a 
window function. The CW distribution was defined in [33] and the ZAM distribution 
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in [34]. The BJ distribution, called the “sine distribution” in [28, p. 26], was defined 
in [7,35] using an operational rule of Born and Jordan [36, p. 873]. Observe that the 
BJ distribution is a special case of the ZAM with a = 1/a and w(t) = a/|2r|. These 
distributions are illustrated for a linear FM signal in Fig. 2.7.1, parts (f)-(i). The 
BD, w-WVD, and MBD kernels are “separable”, while the ic-WVD kernel is also 
“Doppler-independent” and the MBD kernel is also “lag-independent”; these con- 
cepts are discussed later, especially in Article 5.7. In Fig. 2.7.1 parts (j)-(l) we have 
respectively plotted a Doppler- independent, a lag- independent, and a separable- 
kernel TFD of a linear FM signal. We shall see [e.g. in Table 3.3.3 on p. 77] that 
the separable and Doppler (lag)-independent kernels have been introduced in order 
to obtain certain desired TFD properties. 

For convenience, Table 2.7.1 collects and tabulates the definitions of the TFDs 
that have been derived heuristically in this chapter, and of the six TFDs that have 
just been defined in terms of their time-lag kernels. 

2.8 Summary and Discussion 

Constructing a quadratic TFD from the analytic associate of a given real signal, 
rather than from the signal itself, avoids spurious terms caused by interference 
between positive-frequency and negative-frequency components. 

Every TFD that we have derived heuristically is quadratic in the signal ; that is, 
if the signal is scaled by a factor k, the TFD is scaled by a factor k 2 . This is to be 
expected because 

(a) Each TFD considered so far is related to some sort of energy density. For 
example, the Page distribution is defined as a gradient of energy, hence ac- 
commodating for the negative values that occur in the TFD. 

(b) The signal has been assumed to be an effort variable or a flow variable (ex- 
amples of effort variables are voltage, force, and pressure; the corresponding 
flow variables are current, linear velocity, and volume velocity), and power is 
proportional to the product of an effort variable and the corresponding flow 
variable, hence (in a linear system) to the square of the effort variable or of 
the flow variable. 

(c) We have seen in Section 2.7 that every TFD considered in this chapter can be 
written as the FT of a smoothed IAF [Eq. (2.7.1)], which is the convolution of 
an auxiliary function (the time-lag kernel filter G(t, r)) and the ordinary IAF 
K z (t, r), which in turn is quadratic in the signal. 

Eq. (2.7.4) effectively defines a TFD in terms of its time-lag kernel and is the key 
to the theory and design of quadratic TFDs, as will be detailed in the next chapter. 
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Fig. 2.7.1: TFDs of a linear FM signal with displayed duration 65 samples (sampling rate 1Hz), unit 
amplitude from sample 9 to sample 57, zero amplitude elsewhere, frequency range 0.1 (at sample 1) 
to 0.4 (at sample 65): (a) Wigner-Ville; (b) Spectrogram, 21-point rectangular window; (c) Page; 
(d) Levin; (e) Windowed Levin, 19-point Hamming window; (f) Choi-Williams, a = 4;. . . [continued]. 
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Fig. 2.7.1: [continuing] ...(g) Born-Jordan, a = 1/2; (h) Zhao-Atlas-Marks, a = 2, < 72 ( 7 ") = 1 

(i) Zhao-Atlas-Marks, a = 2, giij) = 19-point Hamming window; (j) Doppler-independent 
g 2 (r) = 29-point Hamming window; (k) Lag-independent, gi(t) = 9-point Hanning; (I) Separable ker 
nel, gi(t) = 9-point Hanning, g 2 ( r) = 29-point Hamming. 





56 



Chapter 2: Heuristic Formulation of TFDs 



[4] S. D. Conte and C. de Boor, Elementary numerical analysis: An algorithmic approach. 
Tokyo: McGraw-Hill, 3rd ed., 1980. 

[5] B. Boashash, “Note on the use of the Wigner distribution for time- frequency signal 
analysis,” IEEE Trans. Acoustics, Speech, & Signal Processing, vol. 36, pp. 1518-1521, 
September 1988. 

[6] J. Villc, “Theorie et applications de la notion de signal analytique,” Cables et Trans- 
missions, vol. 2A, no. 1, pp. 61-74, 1948. In French. English translation: I. Selin, Theory 
and applications of the notion of complex signal, Rand Corporation Report T-92 (Santa 
Monica, CA, August 1958). 

[7] L. Cohen, “Time-frequency distributions — A review,” Proc. IEEE, vol. 77, pp. 941-981, 
July 1989. Invited paper. 

[8] B. Boashash and P. J. Black, “An efficient real-time implementation of the Wigner-Villc 
distribution,” IEEE Trans. Acoustics, Speech, & Signal Processing, vol. 35, pp. 1611-1618, 
November 1987. 

[9] B. Boashash and H. J. Whitehouse, “High resolution Wigner-Ville analysis,” in Eleventh 
GRETSI Symp. on Signal Processing and its Applications, pp. 205-208, Nice, France, 1-5 
June 1987. 

[10] H. .J. Whitehouse, B. Boashash, and .J. M. Speiser, “High-resolution processing tech- 
niques for temporal and spatial signals,” in High-resolution methods in underwater acous- 
tics (M. Bouvet and G. Bienvenu, eds.), ch. 4, pp. 127-176, Berlin: Springer, 1991. 

[11] W. Martin, “Time- frequency analysis of random signals,” in Proc. IEEE Internat. 
Conf. on Acoustics, Speech and Signal Processing (ICASSP’82), vol. 3, pp. 1325-1328, 
Paris, 3-5 May 1982. 

[12] D. Gabor, “Theory of communication,” J. IEE, vol. 93(111), pp. 429-457, November 
1946. 

[13] G. R. Putland and B. Boashash, “Can a signal be both monocomponent and 
multicomponent?,” in Third Australasian Workshop on Signal Processing Applications 
(WoSPA 2000), Brisbane, Australia, 14-15 December 2000. Paper no. 32. 

[14] W. Koenig, H. K. Dunn, and L. Y. Lacy, “The sound spectrograph,” J. Acoustical 
Soc. of America, vol. 18, no. 1, pp. 19-49, 1946. 

[15] J. Imberger and B. Boashash, “Application of the Wigner-Ville distribution to tem- 
perature gradient microstructure: A new technique to study small-scale variations,” J. of 
Physical Oceanography, vol. 16, pp. 1997-2012, December 1986. 

[16] L. R. O. Storey, “An investigation of whistling atmospherics,” Phil. Trans. Roy. Soc., 
vol. A246, pp. 113-141, 1953. 

[17] B. Boashash, “Time- frequency signal analysis,” in Advances in Spectrum Analysis 
and Array Processing (S. Haykin, ed.), vol. 1, ch. 9, pp. 418-517, Englewood Cliffs, NJ: 
Prentice-Hall, 1991. 

[18] D. L. Jones and T. W. Parks, “A high-resolution data-adaptive time-frequency repre- 
sentation,” IEEE Trans. Acoustics, Speech, & Signal Processing, vol. 38, pp. 2127-2135, 
December 1990. 

[19] M. K. Emresoy and A. El-Jaroudi, “Iterative instantaneous frequency estimation and 
adaptive matched spectrogram,” Signal Processing, vol. 64, pp. 157-65, January 1998. 




Summary and Discussion 



57 



[20] J. Kay and R. Lerner, Lectures in Communications Theory. McGraw-Hill, 1961. 

[21] C. W. Helstrom, “An expansion of a signal in Gaussian elementary signals,” IEEE 
Trans. Information Theory, vol. 12, pp. 81-82, January 1966. 

[22] I. Daubechies, “The wavelet transform: A method for time-frequency localization,” 
in Advances in Spectrum Analysis and Array Processing (S. Haykin, ed.), vol. 1, ch. 8, 
pp. 366-417, Englewood Cliffs, NJ: Prentice-Hall, 1991. 

[23] Y. Meyer, Wavelets: Algorithms and applications. Philadelphia, PA: Soc. for Indus- 
trial and Applied Mathematics, 1993. Translated and revised by Robert D. Ryan. Original 
French title: Ondelettes et algorithems concurrents. 

[24] S. G. Mallat, A Wavelet Tour of Signal Processing. San Diego / London: Academic 
Press, 2nd ed., 1999. 

[25] C. H. Page, “Instantaneous power spectra,” J. of Applied Physics, vol. 23, pp. 103-106, 
January 1952. 

[26] A. W. Rihaczek, “Signal energy distribution in time and frequency,” IEEE Trans. 
Information Theory, vol. 14, pp. 369-374, May 1968. 

[27] J. G. Kirkwood, “Quantum statistics of almost classical ensembles,” Physics Review, 
vol. 44, pp. 31-37, 1933. 

[28] L. Cohen, “Introduction: A primer on time-frequency analysis,” in Time- Frequency 
Signal Analysis: Methods and Applications (B. Boashash, ed.), ch. 1, pp. 3-42, Mel- 
bourne/N.Y.: Longman-Cheshire/Wiley, 1992. 

[29] M. J. Levin, “Instantaneous spectra and ambiguity functions,” IEEE Trans. Informa- 
tion Theory, vol. 10, pp. 95-97, January 1964. 

[30] H. Margenau and R. N. Hill, “Correlation between measurements in quantum theory,” 
Progress of Theoretical Physics, vol. 26, pp. 722-738, 1961. 

[31] G. R. Putland. personal communications, Signal Processing Research Centre, Queens- 
land University of Technology, 2001. 

[32] T. A. C. M. Claasen and W. F. G. Mecklenbrauker, “The Wigner Distribution — A 
tool for time-frequency signal analysis; Part 3: Relations with other time-frequency signal 
transformations,” Philips J. of Research, vol. 35, no. 6, pp. 372-389, 1980. 

[33] H.-I. Choi and W. J. Williams, “Improved time-frequency representation of multi- 
component signals using exponential kernels,” IEEE Trans. Acoustics, Speech, & Signal 
Processing, vol. 37, pp. 862-871, June 1989. 

[34] Y. Zhao, L. E. Atlas, and R. J. Marks II, “The use of cone-shaped kernels for gener- 
alized time-frequency representations of non-stationary signals,” IEEE Trans. Acoustics, 
Speech, & Signal Processing, vol. 38, pp. 1084-1091, July 1990. 

[35] L. Cohen, “Generalized phase-space distribution functions,” J. of Mathematical 
Physics, vol. 7, pp. 781-786, May 1966. 

[36] M. Born and P. Jordan, “Zur quantenmechanik,” Zeitschrift fur Physik., vol. 34, 
pp. 858-888, 1925. 




58 



Chapter 2: Heuristic Formulation of TFDs 




